Back

JMIR Medical Informatics

16 training papers 2019-06-25 – 2026-03-07

Top medRxiv preprints most likely to be published in this journal, ranked by match strength.

1
Creating an Indexing Scheme for Case Series Articles
2025-12-29 health informatics 10.64898/2025.12.19.25342712
Top 0.1% (4.7%)
Show abstract

ObjectivesCase reports and case series comprise a significant portion of the biomedical literature, yet unlike case reports, the National Library of Medicine does not index case series as a Publication Type. This hurts clinicians and researchers ability to retrieve, identify and analyze evidence from this type of study. Materials and MethodsPubMed articles mentioning "case series" in title or abstract were characterized to learn what are considered to be case series by the authors themselves. W...

2
Single-label and Multi-label Classification for Disease Recognition with Special Consideration of Comorbidities
2025-12-31 health informatics 10.64898/2025.12.23.25342901
Top 0.1% (4.3%)
Show abstract

Certain diseases require rapid treatment to avoid long-term consequences for patients. However, they may be difficult to recognize, especially if the symptoms are ambiguous and compatible with multiple possible diagnoses. Completing all necessary examinations often takes time, thereby prolonging patient suffering. Data-driven approaches, such as single-label classification (SLC) and multi-label classification (MLC), can help accelerate the diagnostic process and improve accuracy. These two appro...

3
Development and validation of an algorithm to identify front-line clinicians using EHR audit log data
2026-02-16 health informatics 10.64898/2026.02.13.26346268
Top 0.2% (4.0%)
Show abstract

BackgroundInterprofessional teams are central to high quality patient care. However, identifying the clinician primarily responsible for a patient requires labor-intensive methodologies. Although electronic health record (EHR) audit logs offer a scalable alternative, its use for identifying frontline clinicians is underdeveloped. ObjectiveTo develop and validate an algorithm utilizing EHR audit logs to identify the primary frontline clinician per patient day of an encounter and to describe care...

4
Development of a Transformer-Based Cardiac Arrest Prediction Model for General Ward Patients
2026-01-13 health informatics 10.64898/2026.01.12.26343973
Top 0.2% (3.9%)
Show abstract

BackgroundIn-hospital cardiac arrest on general wards is often preceded by detectable physiological deterioration, yet conventional early warning scores demonstrate limited discrimination. We developed and performed preliminary validation of a transformer-based cardiac arrest prediction system for general ward patients. MethodsThis retrospective study was conducted among general ward patients at a tertiary academic hospital in South Korea (Severance Hospital, 2013-2017). We developed Cardiac Ar...

5
Identifying Reasons for ACEI/ARB Non-Use in CKD Using Scalable Clinical NLP with Schema-Guided LLM Augmentation
2026-02-12 health informatics 10.64898/2026.02.10.26346025
Top 0.2% (3.9%)
Show abstract

IMPORTANCEAlthough angiotensin-converting enzyme inhibitors (ACEIs) and angiotensin receptor blockers (ARBs) are recommended for people with chronic kidney disease (CKD), they remain underused. Barriers to adherence, such as adverse effects or patient refusal, are frequently embedded within unstructured clinical narratives and are therefore inaccessible to structured data analytics. Scalable natural language processing (NLP) approaches are needed to identify these barriers and support guideline-...

6
Classifying polyneuropathy and myopathy patients on Electronic Health Records
2025-12-12 health informatics 10.64898/2025.12.11.25342051
Top 0.2% (3.9%)
Show abstract

BackgroundRare neuromuscular diseases such as polyneuropathy (PN) and myopathy (MY) often share symptomatic characteristics, leading to diagnostic challenges and delays. Machine learning applied to routine care data of electronic health records (EHRs) offers the potential for accelerating accurate diagnosis. ObjectiveTo develop and evaluate machine learning models to distinguish between patients with PN and MY using EHR data, as a step toward tools that could support improved diagnostic process...

7
Validation of 13,102 ICD-10-CM Codes Using a Large Language Model-Based System
2025-12-31 health informatics 10.64898/2025.12.30.25343244
Top 0.2% (3.9%)
Show abstract

ObjectiveTo comprehensively evaluate the validity of ICD-10-CM codes for both prevalent diagnoses and less common diseases, and to assess the performance of a large language model (LLM)-based system in validating these codes. Materials and MethodsThis retrospective study analyzed hospital admissions from the Medical Information Mart for Intensive Care (MIMIC-IV) database. We developed a validated LLM-based system using GPT-4o, refined through iterative prompt engineering, to assess ICD-10-CM co...

8
Can Machine Learning Algorithms use Contextual Factors to Detect Unwarranted Clinical Variation from Electronic Health Record Encounter Data during the Treatment of Children Diagnosed with Acute Viral Pharyngitis
2026-03-02 health informatics 10.64898/2026.02.23.26346757
Top 0.3% (3.8%)
Show abstract

Rationale, Aims and ObjectivesUnwarranted clinical variation (UCV) in patient care often arises from contextual factors and contributes to increased costs, unnecessary treatments, and deviations from evidence-based practice. Detecting UCV is challenging due to the complexity of care decisions. Current approaches rely on centralized data aggregation and mixed-effects regression, which estimate relative variation but cannot detect absolute variation. Moreover, machine learning (ML) methods leverag...

9
Variability in Automated Sepsis Case Detection: A Systematic Analysis of Implementation Methods in Clinical Data Repositories
2026-03-04 health informatics 10.64898/2026.02.27.26347259
Top 0.3% (3.8%)
Show abstract

ObjectiveTo systematically identify and characterize methodological heterogeneity in sepsis case detection methods using the MIMIC-III database or the eICU-CRD, and to quantify the resulting variability in sepsis detection rates. Materials and MethodsWe conducted a PRISMA-guided systematic review of PubMed and Web of Science (2016-2024), and stratified studies by cohort definition to obtain comparable subsets. We extracted information on sepsis case detection methodology across six domains: par...

10
Hide and Seek: Privacy-Preserving Artificial Intelligence with a Feasibility Study in Rare Disease Diagnosis
2026-01-17 health informatics 10.64898/2026.01.15.26344228
Top 0.3% (3.8%)
Show abstract

BackgroundIntegrating advanced artificial intelligence (AI) into clinical decision-support often requires the sharing of sensitive patient data with external services, raising privacy concerns. Homomorphic encryption (HE) allows computing directly on encrypted data, without revealing the underlying patient information. ObjectivesTo develop a large language model (LLM)-assisted diagnosis framework while preserving patient privacy in the clinical text analysis, by leveraging HE and using rare dis...

11
Machine Learning for Urinary Tract Infection Prediction in Emergency Departments: An Explainable Approach
2025-12-18 health informatics 10.64898/2025.12.13.25342059
Top 0.3% (3.8%)
Show abstract

Urinary tract infections (UTIs) represent a substantial burden in emergency department (ED) settings, where diagnostic delays and the limitations of traditional clinical assessments often result in suboptimal treatment decisions. This study develops an interpretable machine learning framework to enhance real-time UTI prediction accuracy. We analyzed a retrospective dataset of 80,387 ED patient encounters from four institutions (2013-2016), encompassing 220 clinical variables. Four machine learn...

12
Decentralized Patient-Centric Medical Image Exchange: A Hybrid Blockchain and DICOMweb Architecture
2025-12-30 health informatics 10.64898/2025.12.29.25343172
Top 0.4% (3.8%)
Show abstract

The fragmentation of medical imaging data across isolated Picture Archiving and Communication Systems (PACS) creates significant barriers to interoperability. This paper presents a functional Proof of Concept (PoC) for a decentralized, patient-centric medical image exchange system. By combining an Ethereum-based smart contract layer for access control and settlement with an off-chain Node.js "Worker" bridge, the system enables automated, peer-to-peer transfer of DICOM studies between disparate O...

13
Transformer-based structuring of Italian electronic health records with application in cardiac settings
2026-01-23 health informatics 10.64898/2026.01.22.26344603
Top 0.5% (3.7%)
Show abstract

PurposeNatural Language Processing (NLP) has the potential to extract structured clinical knowledge from unstructured Electronic Health Records (EHRs). However, the limited availability of annotated datasets for algorithm training restricts its application in clinical practice. This study investigates the use of transformer-based NLP models to structure Italian EHRs in cardiac settings, addressing this gap. MethodsWe implemented and evaluated three named entity recognition algorithms: SpaCy, Fl...

14
Build fair machine learning models to predict adverse outcomes for Heart failure patients with preserved ejection fraction (HFpEF) and with reduced ejection fraction (HFrEF)
2025-12-19 health informatics 10.64898/2025.12.18.25342417
Top 0.5% (3.7%)
Show abstract

BackgroundHeart failure (HF), including heart failure with preserved ejection fraction (HFpEF) and heart failure with reduced ejection fraction (HFrEF), remains a major global health challenge, particularly among aging populations. Timely and accurate prediction of severe adverse outcomes associated with HF is critical for optimizing care, reducing disease burden, and improving outcomes. Although social determinants of health (SDoH) have been recognized as key drivers of HF disparities and assoc...

15
Patient-Centric Markov-Chain Framework for Predicting Medication Adherence Using De-Identified Data
2026-02-10 health informatics 10.64898/2026.02.08.26345856
Top 0.6% (3.7%)
Show abstract

Long-term adherence to prescribed therapies remains a persistent challenge in chronic and ultra-rare conditions where clinical outcomes depend on continuous medication use. Even brief gaps in therapy can compromise disease control, yet patients frequently encounter structural barriers including high out-of-pocket costs, prior-authorization (PA) delays, annual re-verification cycles, and refill logistics that disrupt persistence. This study evaluates a patient-centric Markov-chain framework for a...

16
Development and Validation of the Intensive Documentation Index for ICU Mortality Prediction: A Temporal Validation Study
2026-02-12 health informatics 10.64898/2026.02.10.26345827
Top 0.6% (3.7%)
Show abstract

BackgroundNursing documentation patterns may reflect patient acuity and clinical deterioration, yet their prognostic value remains underexplored. We developed the Intensive Documentation Index (IDI), a novel framework quantifying temporal documentation rhythms, and evaluated its ability to enhance ICU mortality prediction.58 MethodsWe analyzed 26,153 ICU admissions of heart failure patients from the MIMIC-IV database (2008-2019). Nine IDI features capturing documentation rhythm, volume, and sur...

17
Data-Driven Hybrid Model of SARIMA-CNNAR For Tuberculosis Incidence Time Series Analysis in Nepal
2026-02-24 health informatics 10.64898/2026.02.22.26346853
Top 0.6% (3.7%)
Show abstract

BackgroundTuberculosis (TB) remains a major public health challenge in Nepal, with incidence rates substantially higher than global estimates. Accurate forecasting of TB incidence is essential for early warning systems, resource allocation, and targeted interventions. This study aimed to develop and validate a hybrid Seasonal Autoregressive Integrated Moving Average (SARIMA) and Convolutional Neural Network Auto-Regressive (CNNAR) model for TB incidence forecasting in Nepal. MethodsMonthly TB i...

18
The Causal Impact of Natural Language Processing-Driven Clinical Decision Support on Sepsis Mortality in England: An Augmented Synthetic Control Analysis of NHS Trust-Level Data
2026-03-02 health informatics 10.64898/2026.02.27.26347253
Top 0.6% (3.6%)
Show abstract

BackgroundSepsis remains a leading cause of preventable hospital mortality in England, with NHS England reporting over 48,000 sepsis-related deaths annually. Natural language processing (NLP)-driven clinical decision support systems (CDSS) have been deployed in several NHS Trusts to enable automated early detection of sepsis from unstructured clinical notes, yet causal evidence of their effectiveness at the hospital level remains limited. ObjectiveTo estimate the causal effect of implementing N...

19
SHAPE AI: Development and Expert Validation of a Survey for Human AI Performance Evaluation in Healthcare
2026-01-21 health informatics 10.64898/2026.01.18.26344350
Top 0.6% (3.5%)
Show abstract

ObjectiveTo develop and content-validate a brief, expert-informed Survey for Human-AI Performance Evaluation (SHAPE-AI) for near-real-time assessment of how clinical AI affects human performance. BackgroundAI-enabled clinical decision support can improve outcomes only when aligned with clinician workflows, and cognitive demands. Existing evaluations measure technical performance and adoption, providing limited assessment of how AI shapes human performance. There is a lack of concise, operationa...

20
CLEAR: An Auditable Foundation Model for Radiology Grounded in Clinical Concepts
2026-01-17 health informatics 10.64898/2026.01.15.26344222
Top 0.7% (3.2%)
Show abstract

"Black box" deep learning models for medical image interpretation limit clinical trust and analysis of performance degradation. Here, we introduce Concept-Level Embeddings for Auditable Radiology (CLEAR), an auditable foundation model based on clinical concepts. Trained on over 0.87 million image-report pairs from 239,091 patients, CLEAR learns a visual representation and projects chest X-rays into a semantically rich space defined by large language model embeddings, making every prediction trac...